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It has been reported that exogenously introduced micro- 
RNA (exo-mlRNA) competes with endogenously ex- 
pressed mlRNAs (endo-mlRNAs) in human cells, resulting 
in a detectable upregulation of mRNAs with endo-mlRNA 
target sites (TSs). However, the detailed mechanisms of 
the competition between exo- and endo-mlRNAs remain 
uninvestigated. In this study, using 74 microarrays that 
monitored the whole-transcriptome response after intro- 
ducing mlRNAs or siRNAs into HeLa cells, we systemati- 
cally examined the derepression of mRNAs with exo- 
and/or endo-mlRNA TSs. 

We quantitatively assessed the effect of the number of 
endo-mlRNA TSs on the degree of mRNA derepression. As 
a result, we observed that the number of endo-mlRNA TSs 
was significantly associated with the degree of derepres- 
sion, supporting that the derepression resulted from the 
competition between exo- and endo-mlRNAs. However, 
when we examined whether the site proficiency of exo- 
mlRNA TSs could also influence mRNA derepression, to 
our surprise, we discovered a strong positive correlation. 
Our analysis indicates that site proficiencies of both exo- 
and endo-mlRNA TSs are important determinants for the 
degree of mRNA derepression, implying that the derepres- 
sion of mRNAs in response to exo-mlRNA is more com- 
plex than that currently perceived. 

Our observations may lead to a more complete under- 
standing of the detailed mechanisms of the competition 
between exo- and endo-mlRNAs and to a more accurate 
prediction of mlRNA targets. Our analysis also suggests 
an interesting hypothesis that long 3'-UTRs may function 
as molecular buffer against gene expression regulation by 
individual mlRNAs. 
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INTRODUCTION 

MicroRNA (nniRNA) is an approxinnately 22-nucleotide-long 
single-stranded RNA that regulates its target gene expression 
via the RNA-induced silencing connplex (RISC). RISC consists 
of nniRNA-loaded Argonaute and other proteins, which recog- 
nize specific targets by fornning connplennentary base-pairing 
with nnRNA targets and nnediate nnRNA destabilization and/or 
translational repression (Baek et al., 2008; Hutvagner and Si- 
nnard, 2008; Pratt and MacRae, 2009). Although nniRNA nnainly 
down regulates its target gene expression, recent studies have 
reported that nnany genes are also upregulated in response to 
exogenously introduced nniRNAs (exo-nniRNA) and siRNAs 
(Castanotto et al., 2007; Khan et al., 2009). These upregulated 
nnRNAs include target sites (TSs) for endogenously expressed 
nniRNAs (endo-nniRNAs), and the connpetition between exo- 
and endo-nniRNAs was proposed as a nnodel that nnay explain 
the observed pattern of nnRNA derepression (Castanotto et al., 
2007; Khan et al., 2009). 

However, the previous nnodel did not consider the potential 
interplay between exo- and endo-nniRNAs. Here, we hypothe- 
sized that exo-nniRNA TSs as well as endo-nniRNA TSs nnay 
play a role in deternnining the degree of derepression, since 
both of their regulatory activities are nnediated by the sanne 
RISC nnachinery. The potential coupling between exo- and 
endo-nniRNAs has not been carefully studied. In this study, we 
used 74 publicly available nnicroarrays that nneasured the 
whole-transcriptonne response after introducing nniRNAs or 
siRNAs into HeLa cells to investigate the connpetition between 
exo- and endo-nniRNAs. 

Based on the observations that we have nnade using the 
large dataset of nnicroarrays, we propose a new nnodel that 
nnay better explain the connplicated aspects of the coupled 
connpetition. Our new nnodel for the connpetition between exo- 
and endo-nniRNAs nnay be useful for the developnnent of a 
nniRNA target prediction tool with higher accuracy. We also 
report an interesting hypothesis that long 3'-UTRs nnight serve 
as effective nnolecular buffer that resists gene expression regu- 
lation by individual nniRNAs. 

MATERIALS AND METHODS 

Reference mRNAs 

To obtain a set of unique cDNAs, a database of hunnan nnRNA 
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sequences was constructed using the RefSeq database (Pruitt 
et al., 2000) and the hg19 hunnan genonne (Lander et al., 2001) 
fronn the UCSC Genonne Browser (http://genonne.ucsc.edu). 
The processing and filtering nnethods fronn a previous study by 
Baek et al. were applied in order to obtain the representative 
isofornn of each nnRNA (Baek et al., 2008). Briefly, these filter- 
ing nnethods include discarding any nnRNAs that contains the 
wrong open reading frannes (ORFs) or that are candidates for 
nonsense-nnediated decay. These non-redundant cDNAs were 
used as the reference cDNAs in the analysis. 

Analysis of microarrays 

To observe the change in nnRNA levels on a large scale, the 
published data of a previous study with 74 nnicroarrays was 
used in our analysis (Anderson et al., 2008; Birnninghann et al., 
2006; Garcia et al., 2011; Grinnson et al., 2007; Jackson et al., 
2006a; 2006b; Linn et al., 2005; Schwarz et al., 2006). These 
array data were generated to nneasure nnRNA level changes 
after exogenous nniRNA or siRNA was transfected (Lander et 
al., 2001) into hunnan HeLa cells. The changes in nnRNA ex- 
pression level of our reference cDNAs were assigned accor- 
dingly to the nneasured values depicted on the log2 scale fronn 
the previous study (Garcia et al., 2011). Then, the association 
between nnRNA expression level changes and the characteris- 
tics in the 3'-UTR such as the nniRNA TSs or its length were 
analyzed. 

Statistical analysis and identification of target sites 

In order to nneasure the nunnber of endo-nniRNA TSs fronn each 
nnRNA, ten highly expressed nniRNA in the hunnan HeLa cell 
line were selected fronn the published endo-nniRNA expression 
profile data (Landgraf et al., 2007). The nunnber of endo-nniRNA 
TSs was obtained by counting the nunnber of 8nner, 7nner-nn8, 
7nner-A1 , and 6nner sites, respectively on each 3'-UTR. nnRNAs 
without 6-8nner sites were regarded as "no site" nnRNAs. For 
regression analyses, the sinnple linear regression was per- 
fornned using the 'InnQ' connnnand in R package (http://www.r- 
project.org). To connpare the slopes of trend lines fronn different 
nnRNA subsets, ANCOVA (analysis of covariance) was per- 
fornned using the 'anovaQ' connnnand in R package. 

Target proficiency analysis 

To investigate the effect of exo-nniRNA TSs on the nnRNA de- 
repression, nnRNAs with >1 exo-nniRNA 8nner, 7nner-nn8, or 
7nner-A1 sites in the 3'-UTR were chosen and analyzed. After 
nneasuring the proficiency of exo-nniRNA 7-8nner sites with 
'context+ score' (Garcia et al., 2011), nnRNAs were sorted 
based on their 3'-UTR length and connpared with the predicted 
proficiency of two nnRNAs, whose 3'-UTR lengths were close 
(length difference of < 50 nucleotides). Then, these were sepa- 
rated into two groups depending on their relative proficiency of 
exo-nniRNA 7-8nner sites. In order to ensure that the two groups 
were not influenced by other factors, other than exo-nniRNA TS 
proficiency, the P values for the 3'-UTR length, nunnber of en- 
do-nniRNA TSs, and site proficiency of endo-nniRNA 7-8nner 
sites were calculated with Wilcoxon's rank-sunn-test using 'sci- 
py' package in Python (http://www.python.org). The data were 
fitted with a sinnple linear regression and the slopes of trend 
lines were connpared with ANCOVA. 

Analysis of derepression in mRNAs without exo- or endo- 
mlRNATSs 

To investigate whether the degree of derepression changes 
based on the different criteria for the selection of endo-nniRNAs, 



we analyzed nnRNAs without exo-nniRNA 6-8nner sites in the 3'- 
UTR. Fronn the published endo-nniRNA expression profile data 
in HeLa cells (Landgraf et al., 2007), the 10 nnost highly ex- 
pressed endo-nniRNAs were selected and a subset of nnRNAs 
was nnade by discarding any nnRNAs that contained 6-8nner 
sites for the 10 endo-nniRNAs in the 3'-UTR. The analysis was 
expanded to include the top 20 and the top 30 highly endo- 
nniRNAs as well in order to create a nnore robust set of nnRNAs 
without exo- or endo-nniRNA TSs. The correlation between the 
fold-change and the 3'-UTR length on a log scale was ob- 
served and they were graphically depicted with the nnean fold- 
change and the log scaled 3'-UTR length of 15,000 nnRNAs at 
each point. Then the data were fitted with the sinnple linear 
regression. 

RESULTS 

The target site proficiency of endogenously expressed 
mlRNAs influences mRNA derepression 

Using a large dataset of 74 nnicroarrays that nnonitored the 
whole-transcriptonne response after introducing nniRNAs or 
siRNAs into the HeLa cell line, we systennatically exannined the 
derepression of nnRNAs that have 3'-UTR TSs for the endo- 
nniRNAs. According to previous studies (Castanotto et al., 
2007; Khan et al., 2009), exo-nniRNA or siRNA connpetes with 
endo-nniRNAs and consequentially nnRNAs that have endo- 
nniRNA TSs are detectably upregulated. Consistent with the 
previous reports, we observed that nnRNAs with 3'-UTR 7-8nner 
sites for endo-nniRNAs exhibited a significant upregulation (Fig. 
1A). 

We hypothesized that if the observed upregulation or dere- 
pression was due to the connpetition between exo- and endo- 
nniRNAs, the nunnber of TSs should be associated with the 
degree of derepression. Thus, to quantitatively assess the ef- 
fect of the nunnber of 3'-UTR TSs for endo-nniRNAs on the 
degree of derepression, the correlation between thenn was 
nneasured. Since nnany species of endo-nniRNAs are ex- 
pressed in HeLa cells, it is difficult to identify nnRNAs without 
endo-nniRNA TSs. While a reasonable way to tackle this prob- 
lenn would be to take highly expressed endo-nniRNAs into ac- 
count, it is uncertain to deternnine how nnany endo-nniRNAs 
should be considered. Regardless of the nunnber of endo- 
nniRNAs that we chose, we noticed a strong positive correlation 
between the nunnber of endo-nniRNA TSs and their 3'-UTR 
lengths (Fig. IB), innplying that the 3'-UTR length can serve as 
a good predictor for the nunnber of endo-nniRNA TSs. As a 
result, we found that the degree of derepression was signifi- 
cantly associated with both the nunnber of endo-nniRNA TSs 
and the 3'-UTR length (Figs. 1C and ID). 

To further test whether the observed derepression was influ- 
enced by the presence or absence of exo-nniRNA 3'-UTR TSs, 
we collected a subset of nnRNAs without exo-nniRNA 3'-UTR 6- 
8nner sites and nneasured the correlation between the degree of 
derepression and the nunnber of endo-nniRNA 6-8nner sites. 
These nnRNAs exhibited a clear upregulation in connparison to 
the entire nnRNAs probably due to the absence of exo-nniRNA 
TSs (connpare Figs. 1C vs. IE and ID vs. IF). Furthernnore, a 
significant positive correlation was observed for this subset of 
nnRNAs, indicating that endo-nniRNA TSs nnay induce the 
nnRNA derepression independently of the presence or absence 
of exo-nniRNA TSs on the 3'-UTR (Figs. 1 E and 1 F). 

Then we observed the correlation between nnean fold-change 
and 3'-UTR length on a log scale annong nnRNAs without exo- or 
endo-nniRNA 6-8nner sites to verify that nnRNAs without 
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Fig. 1. The effect of the global competition between exo- and endo-miRNAs on derepression of endo-miRNA targets. (A) Among mRNAs 
without exo-miRNA 6-8mer sites, mRNAs with endo-miRNA 7-8mer sites (red) are significantly upregulated than those without endo-miRNA 6- 
8mer sites (blue). To count the number of endo-miRNA TSs, top 10 highly expressed endo-miRNAs in HeLa cell were considered (Landgraf et 
al., 2007). (B) Correlation between 3'-UTR length and the number of endo-miRNA 7-8mer sites. Top 10 (red), 20 (green), and 30 (magenta) 
highly expressed endo-miRNAs were considered to measure the number of endo-miRNA 7-8mer sites in each case. (C) Using all the mRNAs 
included in our 74 microarray data, fold-change values of mRNAs in response to exo-miRNAs or siRNAs were correlated with the number of 
endo-miRNA 7-8mer sites. (D) Fold-change values were correlated with 3'-UTR length. OthenA/ise as in (C). (E, F) For a subset of mRNAs 
without exo-miRNA 6-8mer sites, the same correlation analyses as in (C, D) were repeated. (G) For mRNAs without exo-miRNA 6-8mer sites 
(blue), the correlation between the mRNA fold-change and the 3'-UTR length on a log scale was analyzed (thick lines) and the simple linear 
regression lines are depicted (thin lines). After discarding mRNAs with 6-8mer sites in the 3'-UTR for the top 10 (red), 20 (green), and 30 (ma- 
genta) highly expressed endo-miRNAs, the same correlation analysis was conducted. Each point represents mean fold-change and the log 
scaled 3'-UTR length of 15,000 mRNAs. 
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Fig. 2. The effect of the local connpetition between exo- and endo- 
nniRNAs on derepression of endo-miRNA targets. (A) For nnRNAs 
with a single exo-nniRNA 8nner site, the correlation between nnRNA 
fold-change and 3'-UTR length was analyzed (red). The same 
correlation analysis was repeated for 7mer-m8 (orange), 7mer-A1 
(green), 6mer (blue), and no site (magenta). (B) mRNA fold-change 
values were correlated with log values of 3'-UTR length (thick lines) 
and their simple linear regression lines (thin lines) are shown to- 
gether. OthenA/ise as in (A). (C) For each pair of mRNAs that have 
nearly identical 3'-UTR, one with a stronger proficiency (measured 
by the sum of context+ scores) of exo-miRNA 7-8mer sites was 
classified to be the "stronger exo-miRNA 7-8mer sites" group, while 
the other was classified as "weaker exo-miRNA 7-8mer sites" group. 
A pair of mRNAs with the smallest difference in their 3'-UTR length 
was iteratively chosen until the 3'-UTR length difference became 
longer than 50 nucleotides. For each of the "stronger exo-miRNA 
TSs" and "weaker exo-miRNA TSs" group, mRNA fold-change 
values were correlated with log values of 3'UTR length. OthenA/ise 
as in (B). 



these TSs are not influenced by nniRNA or siRNA transfection. 
As we took a larger number of endo-miRNAs into consideration, 



the slopes of the linear regression lines were closer to zero, 
indicating that nnRNAs without exo- or endo-nniRNA TSs are 
not affected by nniRNA or siRNA transfection (Fig. 1G). These 
nnRNAs were slightly downregulated probably because we only 
considered TSs in 3'-UTR while sonne ORF TSs of the exo- 
nniRNA can also be functional (Baek et al., 2008; Fornnan et al., 
2008). 

Taken together, these results support the previous observa- 
tion that the connpetition between exo- and endo-nniRNAs nnay 
cause the derepression of endo-nniRNA target nnRNAs and the 
range of the derepression is broad enough to influence exo- 
nniRNA targets as well as non-targets. 

The target site proficiency of exogenously introduced 
miRNAs also influences mRNA derepression 

To exannine whether the site type of an exo-nniRNA TS can 
influence nnRNA derepression, we investigated nnRNAs that 
included a single 3'-UTR 8nner site of exo-nniRNA but no other 
exo-nniRNA TSs. As shown in Fig. 2A, a strong association 
between the nnean nnRNA fold-change and 3'-UTR length was 
observed. nnRNAs with longer 3'-UTRs tended to exhibit a 
weaker response to the exo-nniRNA, probably due to a stronger 
derepression by a larger nunnber of endo-nniRNA TSs. For 
instance, the nnean fold-change of nnRNAs with short 3'-UTRs 
(< 500 nucleotides (nt)) was -0.40 while that of nnRNAs with 
long 3'-UTRs (> 4,000nt) was only -0.06, dennonstrating a 
strikingly weaker response of the latter connpared to the fornner 
(fold-change difference = -0.40-(-0.06) = -0.34). 

However, when looking at nnRNAs that included a single 3'- 
UTR 7nner-nn8 site for exo-nniRNA but no other exo-nniRNA TSs, 
the nnean fold-change between the short and long 3'-UTR 
groups (< 500 nt vs. > 4000 nt) was relatively close (Fig. 2A, 
fold-change difference = -0.21-(-0.03) = -0.18). Sinnilarly, the 
nnean fold-change differences becanne snnaller as we looked at 
weaker site types (Fig. 2A, fold-change differences = -0.13, - 
0.02, and -0.06 for 7nner-A1 , 6nner, and no site, respectively). 

A linear regression between the nnean nnRNA fold-change 
and the 3'-UTR length on a log scale was perfornned to test 
whether the slopes were significantly different depending on the 
site type of exo-nniRNA sites (Fig. 2B). We found that all four 
site types presented significantly different regression slopes in 

, < 
and 

6nner, respectively, Ftest). 

To further control for potential confounding factors such as 
the difference in the site proficiency for endo-nniRNA TSs, we 
chose two groups of nnRNAs that have nearly identical 3'-UTR 
lengths, but one of the group of nnRNAs had a higher site profi- 
ciency of exo-nniRNA 7-8nner sites than the other. Thus, we 
ainned to control for the overall site proficiency of endo-nniRNA 
TSs and to assess the effect of the site proficiency of exo- 
nniRNA TSs on the degree of nnRNA derepression. Indeed, the 
nnedian length, the nunnber of endo-nniRNA 7-8nner sites, and 
the sunn of context+ scores of endo-nniRNA 7-8nner sites be- 
tween the two groups were not significantly different (P = 0.99, 
0.77, and 0.85, respectively, Wilcoxon's rank-sunn test). On the 
other hand, the group with stronger exo-nniRNA TSs showed a 
significantly stronger correlation between nnean nnRNA fold- 
change and 3'-UTR length in connparison to the other group 
with weaker exo-nniRNA TSs (Fig. 20, P= 1.4 x 10"'\ Ftest). 
Our analysis indicates the site proficiency of not only endo- 
nniRNA TSs but also exo-nniRNA TSs are innportant deternni- 
nants for the degree of derepression of endo-nniRNA target, inn- 
plying that the nnRNA derepression in response to exo-nniRNAs 



connparison to the no site group (P< 1.0 x 10"'^ < 1.0 x 10"' 
1.0 X 10"^^, and 2.0 x 10"^^ for 8nner, 7nner-nn8, 7nner-A1, 
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Fig. 3. A proposed nnodel to explain the effect of global and local connpetition between exo- and endo-miRNAs on the derepression of endo- 
nniRNA targets. (A) mRNAs without exo- or endo-nniRNA TSs on their 3'-UTRs are not affected by nniRNA or siRNA transfection. (B) mRNAs 
with endo-nniRNA TSs but no exo-miRNA TS show a moderate degree of derepression caused by the global competition between exo- and 
endo-miRNAs. (C) mRNAs with both exo- and endo-miRNA TSs show a strong derepression, resulting from both the global and local competi- 
tion between exo- and endo-miRNAs. 



or siRNA is nnore connplex than that currently perceived. 
DISCUSSION 

A previously proposed nnodel for the derepression of nnPNAs in 
response to exo-nniRNAs or siRNAs suggested that exo- 
nniRNAs or siRNAs connpete with endo-nniRNAs, resulting in 
detectable upregulation of nnRNAs with endo-nniRNA TSs 
(Khan et al., 2009). Using a large dataset of nnicroarrays that 
nnonitored the whole-transcriptonne response after introducing 
nniRNAs or siRNAs into HeLa cells, we systennatically ex- 
annined the derepression of nnRNAs with exo- and endo- 
nniRNA TSs. By quantitatively assessing the effect of the nunn- 
ber of endo-nniRNA TSs in the 3'-UTR on the degree of dere- 
pression, we found a significant association between the nunn- 
ber of endo-nniRNA TSs and the degree of derepression, sup- 
porting that the derepression resulted fronn the connpetition 
between exo- and endo-nniRNAs. Furthernnore, we discovered 
a strong positive correlation between the site proficiency of exo- 
nniRNA TSs and the degree of derepression, indicating that the 
site proficiencies of both exo- and endo-nniRNA TSs are innpor- 
tant deternninants for the degree of derepression. 

Based on our analysis, we propose a new nnodel that nnay 
explain the connplicated nature of the connpetition between exo- 
and endo-nniRNAs. When an exo-nniRNA is transfected into a 
hunnan cell line, the exo-nniRNA will connpete against the endo- 
nniRNAs for loading onto the Argonaute (Ago) proteins. Sonne 
of the endo-nniRNA-loaded Ago proteins will release the loaded 
endo-nniRNA and bind to an exo-nniRNA, thus decreasing the 
concentration of endo-nniRNA-loaded Ago proteins. Here, 
nnRNAs without exo- or endo-nniRNA TSs are not influenced by 
this global connpetition between exo- and endo-nniRNAs (Fig. 
3A). On the other hand, nnRNAs with endo-nniRNA TSs will be 
derepressed because sonne of the Ago proteins bound to endo- 
nniRNA TSs will be outconnpeted by the high concentration of 
exo-nniRNA (Fig. SB), consistent with the previously proposed 
nnodel (Khan et al., 2009). 



In addition to this "global connpetition" nnodel, we suggest 
another dinnension of the connpetition, where endo-nniRNA TSs 
are located on the sanne 3'-UTR as the exo-nniRNA TSs and 
they connpete against each other. In this case, sonne of the Ago 
proteins bound to endo-nniRNA TSs will be outconnpeted by the 
high concentration of exo-nniRNAs and subsequently bind to 
exo-nniRNA TSs adjacent to the endo-nniRNA TSs, due to the 
short spatial distance between exo- and endo-nniRNA TSs (Fig. 
3C). This "local connpetition" will beconne nnore pronounced 
when the exo-nniRNA TS is nnore proficient; as a nnore profi- 
cient exo-nniRNA TS binds with Ago proteins nnore frequently 
and tightly, the chance that the Ago-unbound endo-nniRNA TSs 
get to bind again to another Ago protein decreases, resulting in 
a stronger overall derepression (Fig. 3C). 

Our current nnodel for the connpetition between exo- and en- 
do-nniRNAs is oversinnplified because it does not consider other 
regulatory elennents that are ennbedded in 3'-UTRs. Especially, 
long 3'-UTRs are nnore likely to include other regulatory ele- 
nnents and highly structured 3'-UTR regions. As a result, nniR- 
NA targeting on these 3'-UTRs nnay be nnore connplicated and 
coupled with other post-transcriptional regulations, which nnight 
be responsible for the stronger derepression that we observed. 
To clearly dissect the effects of the highly entangled regulatory 
elennents, future efforts to revisit this issue using cells without 
endo-nniRNAs are warranted. 

Our analysis on long 3'-UTRs also raises an interesting hypo- 
thesis. Long 3'-UTRs nnay escape gene expression regulation 
by individual nniRNAs. As dennonstrated in Fig. 2, long 3'-UTRs 
showed relatively unchanged nnRNA expression even in 3'- 
UTRs that have highly proficient exo-nniRNA TSs. This is prob- 
ably because of the stronger connbined derepression by a larg- 
er nunnber of endo-nniRNA TSs. nnRNAs with unusually long 3'- 
UTRs, sonne of which are even longer than their ORFs, exist 
but their biological functions rennain largely uninvestigated. Our 
observation of the coupling between 3'-UTR length and the 
efficacy of nniRNA targeting suggests that long 3'-UTRs nnight 
serve as effective nnolecular buffer that resists gene expression 
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regulation by individual nniRNAs. 

Understanding the connplicated nature of the competition be- 
tween exo- and endo-nniRNAs nnay help us draw a nnore conn- 
plete picture of nniRNA targeting rules. For instance, one would 
be able to connputationally nnodel the connpetition between exo- 
and endo-nniRNAs as well as the coupling between 3'-UTR 
length and the efficacy of nniRNA targeting. These future efforts 
nnay lead to innprove the accuracy of nniRNA target prediction 
and reduce off-targeting by siRNA and shRNA. 
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